Initial Sampling for Automatic Interactive Data Exploration

نویسندگان

  • Wenzhao Liu
  • Yanlei Diao
  • Anna Liu
چکیده

In many real world applications, users might not know the queries to send to a database in order to retrieve data in the user-interested areas. Users can apply a trial and error method to discover the queries. However, as the data set is usually quite large, the discovery of queries will take a long time and the whole process is labor-intensive. We want to build a discovery-oriented, interactive data exploration system, that guides users to their interested data areas through interactive sample labeling process. In each iteration, the system will strategically select some sample points to present to users for feedback, as relevant or irrelevant, and finally converge to a query that is able to retrieve all the data in the user-interested area.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Framework for Knowledge-based Interactive Data Exploration

In this paper, we propose a framework that combines the functionality of data exploration and automatic presentation systems to create a knowledge-based, interactive, data exploration system. The purpose of a data exploration system is to enable users to uncover and extract relationships hidden in large data sets. The purpose of an automatic presentation system is to reduce the need for users a...

متن کامل

Visibility-difference entropy for automatic transfer function generation

Direct volume rendering allows for interactive exploration of volumetric data and has become an important tool in many visualization domains. But the insight and information that can be obtained are dependent on the transfer function defining the transparency of voxels. Constructing good transfer functions is one of the most time consuming and cumbersome tasks in volume visualization. We presen...

متن کامل

Framework for Interactive Million-Neuron Simulation Running title: Interactive Million Neuron Simulation

Large simulations have become increasingly complex in many fields, tending to incorporate scaledependent modeling and algorithms and wide-ranging physical influences. This scale of simulation sophistication has not yet been matched in neuroscience. In this paper we describe a framework aimed at enabling natural interaction with complex simulations: their configuration, initial conditions, monit...

متن کامل

Combining User Interaction, Speculative Query Execution and Sampling in the DICE System

The interactive exploration of data cubes has become a popular application, especially over large datasets. In this paper, we present DICE, a combination of a novel frontend query interface and distributed aggregation backend that enables interactive cube exploration. DICE provides a convenient, practical alternative to the typical offline cube materialization strategy by allowing the user to e...

متن کامل

AIDE: An Automated Sample-based Approach for Interactive Data Exploration

In this paper, we argue that database systems be augmented with an automated data exploration service that methodically steers users through the data in a meaningful way. Such an automated system is crucial for deriving insights from complex datasets found in many big data applications such as scientific and healthcare applications as well as for reducing the human effort of data exploration. T...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2016